Multiple message retrieval for secure electronic communication

ABSTRACT

Techniques for multiple message retrieval for secure electronic communication are described. The techniques, for instance, utilize a server and computing devices employing a private information retrieval scheme to allow a receiving device to locate multiple electronic communications on a server, request delivery of the multiple electronic communications without the server being aware of which electronic communication are requested, and receive the electronic communications without the server being aware of which electronic communications were sent. For example, the server may utilize an efficient electronic communication storage structure for storing and retrieving multiple electronic communications.

BACKGROUND

Keeping communication private has become increasing important in an era of mass surveillance and cyberattacks. For example, government officials and business leaders travel to hostile locations and rely on local, untrusted infrastructure to communicate using mobile devices and computers. Additionally, even in friendly locations, servers and internet service providers may be untrusted. Many people are also concerned that communication is being monitored by private parties and government agencies. Not only are people concerned with content of communication being discovered, but metadata surrounding the communication as well. Metadata may contain information about the communication including identities of parties that sent and received the communication, a time of the communication, and a frequency of the communication. Although communication may be encrypted to protect content of the communication, maintaining privacy of associated metadata is an increasingly important challenge.

Conventional methods attempting to protect metadata of communication are either insufficiently secure or limited by network and other computing costs. When a user attempts to receive a high volume of communication through a sophisticated secure communication system, the user may be compelled to bear high network or other computing costs and/or delay receiving communications because of the high network or other computing costs.

SUMMARY

This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.

Techniques for multiple message retrieval for secure electronic communication are described. The techniques, for instance, utilize a server and computing devices employing a private information retrieval scheme to allow a receiving device to locate multiple electronic communications on a server, request delivery of the multiple electronic communications without the server being aware of which electronic communication are requested, and receive the electronic communications without the server being aware of which electronic communications were sent. For example, the server may utilize an efficient electronic communication storage structure for storing and retrieving multiple electronic communications.

BRIEF DESCRIPTION OF THE DRAWINGS

The detailed description is described with reference to the accompanying figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The use of the same reference numbers in different instances in the description and the figures may indicate similar or identical items.

FIG. 1 is an illustration of an example environment in accordance with one or more implementations.

FIG. 2 is an illustration of an example local computing device in accordance with one or more implementations.

FIG. 3 is an illustration of an example server in accordance with one or more implementations.

FIG. 4 is an illustration of an example media storage module in accordance with to one or more implementations.

FIG. 5 is an illustration of a mailbox bin according to one or more implementations.

FIG. 6 is an illustration of an example media storage module in accordance with one or more implementations.

FIG. 7 is a flow diagram that describes operations in a method in accordance with one or more implementations.

FIG. 8 is a flow diagram that describes operations in a method in accordance with one or more implementations.

FIG. 9 is a flow diagram that describes operations in a method in accordance with one or more implementations.

FIG. 10 illustrates an example system including various components of an example device that can be implemented as any type of computing device as described and/or utilize with reference to FIGS. 1-9 to implement aspects of the techniques described herein.

DETAILED DESCRIPTION

Overview

Conventional methods of electronic communication rely on data encryption to protect content of the electronic communication. Although encrypting the content of electronic communication is important, it may also be important to protect metadata relating to identification of a relationship of a sender and receiver, timing of a communication between the sender and receiver, frequency of communication between the sender and receiver, and so forth. Some attempts to securely communicate while protecting metadata rely on a trusted internet service provider (ISP) or proxy server to protect the metadata. However, when a trusted server is not accessible, metadata will not be protected. Additionally, governments and other parties may force a trusted ISP or proxy server to reveal the protected metadata, thus reducing confidence in relying on an ISP.

Techniques are described to efficiently exchange a plurality of electronic communications and protect metadata surrounding the electronic communications. Examples of metadata include data relating to identification of a relationship of a sender and receiver, timing of a communication between the sender and receiver, and frequency of communication between the sender and receiver, and so forth. This may be accomplished using a multi-retrieval scheme with a private information retrieval scheme such that a receiving device is able to locate and retrieve a plurality of electronic communications from a server while protecting the metadata from the server including which electronic communications were located and retrieved.

In an example implementation, a plurality of sending devices send at least one encrypted message having at least one label to a server and for receipt by a receiving device. The label is appended to the encrypted message to identify the encrypted message so that the encrypted message is retrievable by an intended recipient. In comparison with conventional communications of messages where a message is identified with one or more of a sender or an intended recipient, using a label does not communicate an identity the intended recipient. The server then stores each of the encrypted messages into at least one mailbox having a corresponding label. The mailboxes are sorted into mailbox bins of the server defined by a range of mailbox labels that are stored therein. The mailbox bins are indexed into hierarchal data structures, such as binary search trees based on labels of mailboxes stored therein. An index matrix, or vector, is generated by the server to represent locations of each of the mailboxes in the mailbox bins. In some implementations, a single encrypted message has two labels and is stored in two different mailboxes having corresponding labels. In some of these mailboxes, the labels are generated such that the two different mailboxes will be stored in two different bins.

According to one or more implementations, a receiving device sends a query to the server to locate a target mailbox in each of a plurality of the mailbox bins. For instance, before sending the query, the receiving device discovers the ranges of labels defining each of the mailbox bins. In at least some implementations, the receiving device knows the labels of target mailboxes, based on a shared secret or shared key with a sender of each electronic communication stored in the target mailboxes, and the receiving device is able to target a plurality of bins for a target mailbox in each mailbox bin. The query, for example, includes querying each mailbox bin of the plurality of mailbox bins, even if a mailbox bin does not contain a target mailbox, further hiding identities of target mailboxes and protecting metadata.

In some implementations, the query includes a series of probes that continue to probe after the indexed location of the electronic communication has been found in a mailbox bin. Continuing to probe may be helpful in protecting metadata by preventing the server from determining which encrypted message the receiving device is attempting to locate.

According to various implementations, after determining an indexed location of a target mailbox in a bin, the receiving device sends a query matrix, or query vector, to the mailbox bin. The receiving device, for instance, may send a set of query matrices including a query matrix directed to each mailbox bin. Each of the query matrices are configured with encrypted entries that encrypt to zero at all entries except for a location corresponding to the target mailbox in each of the plurality of mailbox bins. This query vector generation process uses an encryption system that is additively homomorphic. That is, the encryption system allows the server to perform an operation on two ciphertexts (e.g., Enc(a) and Enc(b)) to obtain a ciphertext that corresponds to the sum of those encrypted values Enc(a+b). Furthermore, such an encryption system allows computing Enc(c*a) starting with c and Enc(a) via repeated addition of Enc(a) with itself c times, enabling the computation of a dot product between an encrypted vector and a plaintext vector. Thus, each query matrix is configured such that when the query matrix is multiplied with the index of the mailbox bin to which it is sent, the content of all but the target mailbox is filtered. In an example implementation, the query matrix is in vector form (“query vector”) and a dot product is performed to multiply the query vector and the index of the mailbox bin to which it is sent. The result of multiplying the query matrix and the index of the mailbox bin is then delivered to the receiving device. Because the server is unable to decrypt the entries of the query matrix, the server is unaware of which mailbox was used to store the delivered encrypted message. Thus, the encrypted message is received from the sending device and delivered to the receiving device without the server being able to detect a relationship between the sending device and the receiving device.

In the following discussion, an example environment is first described that may employ the techniques described herein. Example procedures are then described which may be performed in the example environment as well as other environments. Consequently, performance of the example procedures is not limited to the example environment and the example environment is not limited to performance of the example procedures. Finally, an example system and device are described that may be employed to perform various techniques described herein.

Example Environment

FIG. 1 illustrates an environment 100 in an example implementation for securely communicating while protecting metadata using multi-retrieval that is operable to employ the techniques as described herein. The illustrated environment 100 includes sending computing devices 102-1, 102-2, and 102-3 for creating or selecting media content for sending as a secure communication to a server 104. By way of example, and not limitation, computing devices 102-1, 102-2, and 102-3 may be any combination of computing devices such as laptop computing devices, tablet computing devices, mobile phones, desktop computers, or wearable smart devices.

The server 104 includes media storage module 106 comprising mailbox bin 108(1) through mailbox bin 108(n) for storing mailboxes 110(1) through 110(n). During a sending phase, or after a sending phase has completed, electronic communications 112 are stored in mailboxes 110(1)-110(n), each having a label 114 corresponding to a label 114 appended to the electronic communication stored therein. In some implementations, the server 104 generates mailboxes 110(1)-110(n) in response to receiving the electronic communications 112. The mailboxes 110(1)-110(n) are then stored in one of mailbox bins 108(1)-108(n).

Mailbox bins 108(1)-108(n) may be defined by label ranges of the mailboxes 110(1)-110(n) stored therein. For example, the mailbox bin 108(1) may be defined as a mailbox bin for storing mailboxes 110(1) ranging from 1-100, mailbox bin 108(2) may be defined as a mailbox bin for storing mailboxes 110(2) ranging from 101-200, and mailbox bin n may be defined as a mailbox bin for storing mailboxes 110(n) from 201-300.

The server 104 is configured to communicate over a network 116 to receive electronic communications 112 from computing devices 102-1, 102-2, and 102-3 and send electronic communications 112 to a receiving computing device 118. Receiving computing device 118 is configured to retrieve the electronic communications 112 using a multi-retrieval system in connection with a private information retrieval scheme including submitting a query 120 via the network 116 to request and locate the contents of target mailboxes in a plurality of mailbox bins 108(1)-108(n) based on labels 114 while protecting metadata surrounding the electronic communications.

Sending computing devices 102-1, 102-2, and 102-3 and computing device 118 may be configured as full resource devices with substantial memory and processor resources (e.g., personal computers, game consoles, or terminal computing devices), a mid-resource device with moderate memory and resources (e.g., a netbook), or a low-resource device with limited memory and/or processing resources (e.g., mobile devices, automobile computing devices, or wearable smart devices). Server 104 may be representative of one or a plurality of different devices, such as multiple servers utilized by a business to perform operations “over the cloud” as further described in relation to FIG. 10.

FIG. 2 illustrates an example implementation of one or more of computing devices 102-1, 102-2, and 102-3 of FIG. 1, which represent implementations of the sending computing device 102, and may also represent an example implementation of computing device 118 of FIG. 1. Sending computing device 102 is illustrated with six examples devices: a laptop computer 102-1, a tablet computer 102-2, a smart phone 102-3, a set-top box 102-4, a desktop computer 102-5, and a gaming device 102-6, though other computing devices and systems, such as servers and netbooks, may also be used.

Sending computing device 102 includes or has access to computer processor(s) 202, computer-readable storage media (“media”) 204, communication hardware 206, and I/O interfaces 208. Examples of the I/O interfaces 208 include a display, a touch sensitive input, a keyboard, a gaming remote, etc. Media 204 includes an operating system 208, media creation module 212, encryption/decryption module 214, label generation module 216, and query module 218.

Generally, when sending one or more of electronic communications 112 by computing device 102, a user may interact with I/O interfaces 208 to select or generate electronic media for sending. In at least some implementations, operating system 210 facilitates using media creation module 212 to create a message, document, drawing, presentation, spreadsheet or any other form or combination of electronic media for sending as electronic communication 112. Alternatively or additionally, operating system 210 facilitates choosing existing electronic media as part of electronic communication 112.

Once electronic communication 112 is created or selected, it is encrypted using encryption/decryption module 214 and at least one a label 114 is generated to identify the electronic communication 112. In at least some implementations, the encryption is performed using an encryption key that is generated based on a known secret shared between computing device 102-1, 102-2, or 102-3 and computing device 118. To decrypt a message using the encryption/decryption module, computing device 118 uses a decryption key corresponding to the encryption key that may also be based on the shared secret. After encryption, content of the electronic communication 112 is considered secure.

Sending computing device 102 then sends the encrypted electronic communication 112 to the server 104 along with at least one corresponding label 114 generated by label generation module 216. In some implementations, electronic communications 112 are sent with two distinct labels 114 that are configured to result in one copy of one of electronic communications 112 being stored in one mailbox bin and another copy of the one of electronic communications 112 being store in another mailbox bin of media storage module 106. In some implementations, labels 114 are generated based on the shared secret between one or more of computing devices 102-1, 102-2, or 102-3 and computing device 118 so that computing device 118 is aware of the label 114 for which it is looking. According to various implementations, the electronic communications 112 are sent to the server 104 with a label 114 that does not identify an intended recipient. Accordingly, metadata surrounding the electronic communication 112 is protected from the server 104 including metadata relating to identification of a receiver and a relationship existing therewith between a sender and a receiver.

FIG. 3 illustrates an example implementation of the server 104 in further detail. The server 104 is shown as a singular entity for visual brevity, though multiple devices may instead be used. The server 104 includes or has to access to processor(s) 302 computer-readable storage media 304 (server media 304), and communication module 306. Media 304 includes or has access to a media storage module 106 and a matrix operations module 308, and the media storage module 106 includes mailboxes indices 310 and mailboxes bins 108(1)-108(n) (including mailbox bin 108(2)).

In some implementations, mailbox bins 108(1)-108(n) are partitioned into mailbox bins having static label space values. Alternatively or additionally, mailbox bins 108(1)-108(n) are configured after a predetermined time has elapsed for sending electronic communications (a sending phase) such that a quantity of mailboxes in each mailbox bin are roughly equal.

According to one or more implementations, each mailbox bin is configured to store all mailboxes within the range of labels defining the mailbox bin. In some implementations, a quantity of mailbox bins is equal to a quantity of electronic communications 112 that computing device 118 is permitted to receive from the server 104 in a single receive phase. For instance, mailboxes indices 310 updates index entries for each of the mailbox bins based on the electronic communications 112 received, examples of which are illustrated in FIGS. 4-6. As shown in FIGS. 5-6, some implementations of the mailboxes indices 310 order the mailbox bins 108(1)-108(n) into a hierarchal structure such as a binary search tree to facilitate efficient and private locating of a target mailbox. In some implementations, as shown in FIG. 6, electronic communications 112 may be sent with more than one label 114, and thus may be stored in more than one mailbox bin. Once a configuration of mailbox bins 108(1)-108(n) is complete, the server 104 may send mailbox bin information to users including quantities of mailboxes stored in each of the mailbox bins 108(1)-108(n).

Alternatively or additionally, mailbox bins 108(1)-108(n) may be configured using batch codes. In an example implementation of using sub-cube batch codes, a candidate batch code, in generating mailbox bins, the server 104 includes a first mailbox bin and a second mailbox bin, e.g., 108(1) and 108(2), with the first mailbox bin storing a first half of mailboxes sorted alphanumerically by respective labels 114 and the second mailbox bin storing a remaining second half of mailboxes sorted alphanumerically by respective labels 114. The first and second mailbox bins are organized into binary search trees to facilitate querying. Further, the server 104 generates a third mailbox bin, e.g., 108(n), and structures the third mailbox bin b₃ as a binary search tree from the first and second mailbox bins as follows: For every level i and index j, b₃(i,j)=b₁(i,j)⊕b₂(i,j), where b₁ is the first mailbox bin and b₂ is the second mailbox bin. In this configuration, b₃ is not ordered, so it is not a binary search tree and cannot be user directly for querying. The server 104 then communicates a lowest label of b₂ to computing device 118 so that computing device 118 can determine whether two target mailboxes are contained in a same mailbox bin b₁ or b₂. If the two target mailboxes are in different mailbox bins, computing device 118 queries each of b₁ and b₂ to locate and retrieve the two target mailboxes. If the two target mailboxes are in the same bin, for example, b₁ computing device 118 can locate and retrieve one mailbox from b₁ and the other from b₃ using joint tree traversal as part of a query.

To perform joint tree traversal, a computing device 118 retrieves a pair of labels 114 and encrypted content (a tuple) from the first level of b₁ and b₃. While the tuples of the first level b₁ and b₃ may be irrelevant to the query, they may be used to compute {Label, content}=(tuple)₁⊕(tuple)₃. The computing device 118 can then compare Label₂ and Label to determine whether to probe up or down in the next level of b₃. If Label₂=Label computing device 118 has located the target encrypted message in b₃. Sub-cube batch codes, such as the example discussed, may be combined with the multiple mailbox bin structure disclosed herein to increase a quantity of mailboxes that can be retrieved from the server 104 without repeating a query process.

Returning again to local computing device 102 of FIG. 2 as an example illustration of computing device 118, computing device 118 sends the query 120 using query module 218 and communication hardware 206 via the network 116 to the server 104, and the server 104 receives the query 120 via the communication module 306. A first component of the query 120 is used to locate a target mailbox in each mailbox bin storing a target mailbox. In some implementations, before sending the query 120, computing device 118 may determine a mailbox content retrieval strategy for locating and requesting target mailboxes in one iteration of query based on mailbox bin information such as ranges of labels 114 stored in each mailbox bin. For example, if a first electronic communication is stored in mailboxes located in a first and third mailbox bin, a second electronic communication is stored in mailboxes located in a first and second mailbox bin, a third electronic communication is stored in mailboxes located in a second and fourth mailbox bin, and a fourth electronic communication is stored in mailboxes located in the first and third mailbox bin, then the receiving device may determine that it will probe for the first electronic communication in the first mailbox bin, the second target electronic communication in the second mailbox bin, the third electronic communication in the fourth mailbox bin, and the fourth electronic communication in the third mailbox bin.

Locating a target mailbox in each mailbox bin storing a target mailbox may include a request for all mailbox indices 310. However, when the mailbox indices are very large, this may not practical because of the computing cost, and possible monetary cost, of sending high volumes of data via network 116. Alternatively or additionally, the first component of the query 120 may include a series of probes, the probes being configured to locate the target mailbox and determine its indexed location within its mailbox bin based on the label 114 of the target mailbox. Such probes may use a private information retrieval protocol (discussed below) to avoid leaking to the server an identity of a probed mailbox. Generally, a probe involves choosing a location of the index as an initial “guess,” privately retrieving a label 114 at that location (without revealing to the server which label was retrieved, and refining the guess for a following probe until a target mailbox is located. A probe process, for instance, represents a search algorithm that searches using a label 114 to locate a target mailbox. In some implementations where the mailbox bins 108(1)-108(n) are organized into a hierarchal structure, the first component of the query 120 may also include a request for the highest levels of the hierarchal structure for more efficient probing. For instance, an index of a level of hierarchal structure, e.g., a binary search tree, may be derived from information gathered from the result of a previous probe and semantics of the hierarchal data structure

In at least some implementations, each probe of the query 120 may be directed based on a result of a previous probe and semantics of the hierarchal structure. Further, the query 120 may continue to dummy probe the index of mailbox bin 108(1) to obfuscate that the query was searching for the target mailbox. Probing may be repeated for each mailbox bin 108(1)-108(n) containing a target mailbox, or alternatively, this process may be repeated for each mailbox bin 108(1)-108(n) regardless of whether a mailbox bin contains a target mailbox. If probing a mailbox bin that does not contain a target mailbox, probing may include a series of random probes (“dummy probes”) configured to simulate probing that is locating a target mailbox. In some of these implementations, a quantity of times probing is repeated is a predetermined quantity and is performed on each mailbox bin to hide from the server the identity of the mailbox bin containing an additional target mailbox.

After locating the target mailboxes in the mailbox bins 108(1)-108(n), computing device 118 generates a second component of the query 120, query vectors, using the query module 218. For simplicity, implementations using a query vector are discussed, but a query matrix m be substituted for a query vector with matrix multiplication substituted for a dot product computation. The first component of the query 120 may be referred to as the query and the second component of the query 120 may be referred to as the query vector and/or query matrix. As discussed above, query vectors are configured such that, after performing a dot product with the index of the target mailbox bin, unwanted content from the rest of mailboxes stored in the target mailbox bin is filtered away and only content of the target mailbox remains isolated. The computing device 118 sends query vectors targeting a plurality of mailbox bins to the server 104 via the network 116. In some implementations, computing device 118 sends query vectors targeting each of the mailbox bins 108(1)-108(n), sending a dummy query vector to the mailbox bins not containing any target mailboxes. The dummy query, for instance, may have ciphertext entries encrypted to 0 at all entries and may be configured to imitate querying without indexed locations of target mailboxes to find. The server then carries out matrix operations using the matrix operations module 308 to multiply the encrypted entries of the query matrices with entries of the target mailbox bins, the results of which are sent via communication module 306 to computing device 118 via the network 116.

Thus, computing devices 102-1, 102-2, and 102-3 may send a plurality of electronic communications 112 to the second computing device 118 while protecting metadata surrounding the electronic communications 112. The metadata, for instance, relates to identification of a relationship between computing device 102 and computing device 118.

FIG. 4 illustrates an example implementation 400 of mailboxes being stored in a simple list structure in mailbox bins 108(1), 108(2), and 108(n). Mailbox bin 108(1) is configured to store mailboxes having labels between a label of mailbox 402 and a label of mailbox 410. Mailbox bin 108(2) is configured to store mailboxes having labels between a label of mailbox 412 and a label of mailbox 420. Mailbox bin 112-n is configured to store mailboxes having labels between a label of mailbox 422 and a label of mailbox 430. When the quantity of mailboxes 110(1)-110(n) stored in each of the mailbox bins 108(1)-108(n) is low, the illustrated list structure may allow the query 120 to efficiently locate the first mailbox. When using a single layer list structure, a query may locate a target mailbox in a mailbox bin by generating a series of probes directed to the mailbox bin. For example, if a mailbox 404 is the target mailbox and has a label 114 known by the receiving device to be 4, a first probe may be directed at mailbox 408 and return a label of 8. Because the returned label is higher than the label of the target mailbox, a second probe may be directed at mailbox 406 and return a label of 6. Again, because the returned label is higher than the label of the target mailbox, a third probe may be directed at mailbox 404, and return a label of 4. Because this is the label 114 of the target mailbox, the query has located an indexed location of the target mailbox. The query may continue probing additional mailboxes, e.g., 402 and 410, to obfuscate that the query was searching for mailbox 404. In some implementations, a quantity of probes may be equal to a quantity of levels of the binary search tree to obfuscate an identity of the targeted mailbox.

FIG. 5 illustrates an example implementation 500 of one of mailbox bins 108(1)-108(n) being organized into a binary search tree hierarchal structure with the highest levels of the binary search tree including mailboxes 508, 504, 512, 502, 506, 510, and 514. Each level of the binary search tree, for example, is populated with a mailbox label 114 to improve efficiency of the query 120. The highest level of the binary search tree, mailbox 508, has a label 114 that is generally near the median of the mailboxes stored in the mailbox bin shown. A first probe may determine that mailbox 508 has a label of 50. If the query 120 is searching for a mailbox that is alphanumerically lower than 50, the next probe will target the second level of the binary search tree that is lower than 50, mailbox 504. Alternatively, if the query 120 is searching for a mailbox that is alphanumerically higher than 50, the next probe will target the second level of the binary search tree that is higher than 50, mailbox 512. Probing will continue until the target mailbox is located. To hide the identity of the target first mailbox, query 120 may continue probing after the first mailbox is located. For instance, the query 120 is configured to perform n quantity of probes after the target mailbox is located to obfuscate that the target mailbox has already been located. In at least some of these implementations, n is a random quantity.

In some implementations, the query 120 may request a plurality of levels, e.g., the highest levels, of the binary search tree and choose to begin probing at one of mailboxes 502, 506, 510, or 514 based on the highest levels. In the illustrated implementation, requesting the highest levels of the binary search tree may increase efficiency and lower computing costs by replacing two probe iterations. In some implementations, the highest levels of the binary search tree may be defined as a predetermined quantity of highest levels of the binary search tree. In other implementations, the highest levels of the binary search tree may be defined as a predetermined percentage of highest levels of the binary search tree.

FIG. 6 illustrates an example implementation 600 of mailbox bins 108(1) and 108(2) storing mailboxes having messages (electronic communications 112) in binary search tree structures. As illustrated, messages 614, 612, 610, and 604 are found in both mailbox bins. This is a result of computing devices 102-1, 102-2, and/or 102-3 sending electronic messages with at least two labels 114. Each of messages 614, 612, 610, and 604 are stored in two mailbox bins because, even though the messages are identical (after decrypting), the labels 114 are different. As an example of the benefits of sending electronic messages with at least two labels 114, consider an example where messages 610 and 612 are both target messages. Without the messages being stored under two different labels 114, both may be sorted into the same mailbox bin. This would mean that computing device 118 would have to repeat the query to retrieve both messages out of the same mailbox bin. However, repeating the query based on not receiving all of target messages from a mailbox bin may reveal metadata to the server 104.

Therefore, to avoid revealing metadata to the server 104, the query may be repeated a predetermined quantity of times, regardless of how many target mailboxes are located in a single mailbox bin. In at least some implementations, the query is repeated in every mailbox bin the same quantity of times. Alternatively or additionally, as shown in FIG. 6, computing device 118 may determine that it can locate and request message 610 from mailbox bin 108(1) and message 612 from mailbox bin 108(2) in a single query because message 610 is identified by two labels 114. As discussed above, the query may continue to probe the binary search tree after locating a target mailbox and may deliver a query vector to a mailbox bin not containing any target mailboxes to hide the identity of the target mailboxes.

Example Methods

The following discussion describes methods for efficiently exchanging electronic communication securely using multi-retrieval systems and while providing relationship unobservability and protecting other metadata. Aspects of each of the methods may be implemented in hardware, firmware, or software, or a combination thereof. The methods are shown as a set of blocks that specify operations performed by one or more devices and are not necessarily limited to the orders shown for performing the operations by the respective blocks. In portions of the following discussion, reference will be made to FIGS. 1-3.

FIG. 7 depicts method 700 in an example implementation in which a server 104 facilitates multi-retrieval of electronic communications 112 while maintaining privacy of associated metadata. By doing so, users are able to more efficiently, more quickly, and with reduced computing and networking costs, communicate without fear of metadata, including relationships between users, being discoverable by hostile or other unwanted parties.

At operation 702, multiple electronic communications having labels are stored in multiple mailboxes. The server 104, for instance, stores multiple electronic communications 112 having labels 114 in multiple mailboxes. In some implementations, a specified interval of time is defined as a receive phase in which electronic communications 112 may be received by the server 104. As discussed above, each of electronic communications 112 has at least one label 114 and is stored in a mailbox having a corresponding label 114.

At operation 704, the multiple mailboxes are sorted into multiple mailbox bins based on the labels. For example, the server 104 indexes the multiple mailboxes 110(1)-110(n) into multiple mailbox bins based on labels 114. This includes depositing the multiple mailboxes 110(1)-110(n) into mailbox bins, each configured to receive, store, and index mailboxes 110(1)-110(n) within a designated range of labels 114. The designated ranges may be mutually exclusive between mailbox bins. In some implementations, indexing electronic communications 112 may be performed continuously as electronic communications 112 are received. Alternatively, or additionally, indexing may be performed after conclusion of a receive phase. In at least some implementations, the mailboxes within each mailbox bin are indexed into a binary search tree to facilitate efficient querying.

At operation 706 a query is received to locate a plurality of target mailboxes in a plurality of the mailbox bins. Server 104, for instance receives a query 120 from computing device 118 to locate target mailboxes in each of a plurality of the multiple mailbox bins from computing device 118 based on the labels 114 of the target mailboxes. In at least some embodiments, the query includes one or more probes to locate indexed locations of at least one target mailbox in each of a plurality of the multiple mailbox bins. In some implementations, the query 120 includes probing mailbox bins that do not contain a target mailbox to further protect identities of the target mailboxes.

At operation 708, query vectors are received for each of the plurality of mailbox bins to isolate contents of a target mailbox in the mailbox bin. For example, the server 104 receives query vectors from computing device 118 to isolate contents of a target mailbox within each of the plurality of mailbox bins and filter away contents of unwanted mailboxes. This may be accomplished by the matrix operations module 308 performing a dot product of each query vector and the index of the mailbox bin for which it is generated and to which it is directed. Alternatively or additionally, one or more of the query vectors may be in the form of a matrix.

At operation 710, content of a target mailbox stored in each of the plurality of mailbox bins is sent to an external computing device. For instance, the server 104 sends a target mailbox stored in each of the plurality of mailbox bins, including electronic communications 112 to computing device 118 while maintaining privacy of associated metadata. In some implementations, sending of electronic communications 112 is performed at predefined intervals of time.

As illustrated, operations 702-710 may be repeated. For example, the server 104 may be set to repeat method 700 on predetermined intervals. The server 104 may solicit electronic communications from users at repeated operation 702 and solicit a query and query matrix from a user in repeated operations 706 and 708. In some implementations, computing devices 102-1, 102-2, or 102-3 may send dummy messages including a meaningless encrypted electronic communication or a random label 114 in response to being solicited for an electronic communication when computing devices 102-1, 102-2, or 102-3 do not have a genuine electronic message to send.

FIG. 8 depicts method 800 in an example implementation in which a computing device 102-1, 102-2, or 102-3 sends an electronic communication to the server 104 for use in multi-retrieval of electronic communications 112 while maintaining privacy of associated metadata. At operation 802, media content is created or selected for sending as electronic communication. For example, a user interacts with I/O Interfaces 208 to create or select media content for sending as one of electronic communications 112. Electronic communications 112 may include, for example, one or more of text messages, electronic mail, documents, spreadsheets, presentations, images, etc. In some implementations, a user generates at least a part of the electronic communication.

At operation 804, a first label is generated for the electronic communication. For example, a first label 114 for the electronic communication is generated by the label generation module 216. The label generation module 216 may use a pseudorandom operation on a key or secret shared between computing device 102-1, 102-2, or 102-3 and computing device 118.

At operation 806, a second label for the electronic communication is generated. For example, a second label 114 is generated for the electronic communication device by the label generation module 216 to facilitate the server 104 storing the electronic communication in two different mailbox bins according to the two labels 114. In at least some implementations, operation 806 is optional, omission of which would leave the electronic communication with only one label 114 and would result in the electronic message being stored in only one mailbox bin of the server 104.

At operation 808, the electronic communication is encrypted using an encryption key. The electronic communication is encrypted, for example, by the encryption/decryption module 216 of computing device 102-1, 102-2, or 102-3 using an encryption scheme. The encryption scheme may use the key or secret shared between computing device 102-1, 102-2, or 102-3 and computing device 118.

At operation 810, the electronic communication is sent to the server to be indexed based on at least the first label. For example, computing device 102-1, 102-2, or 102-3 may send the electronic communication to the server 104 to be indexed based on at least the first label 114. In implementations where two or more labels are generated for the electronic communication, e.g., at operation 806, the electronic communication may be delivered to the server to be indexed based on each of the two or more labels. Alternatively or additionally, the electronic communication may be duplicated by a sending computing device, e.g., electronic device 102, and sent as two or more electronic communications, each having a unique label. As illustrated, method 800 may be repeated. In some implementations, method 800 may be repeated multiple times as computing device 102 sends multiple electronic communications during a sending phase.

FIG. 9 depicts method 900 in an example implementation in which a computing device 118 locates and receives electronic communications 112 within a plurality of mailbox bins while protecting metadata associated with the delivery of the electronic communication 112 from a sending device 102-1, 102-2, or 102-3 to computing device 118.

At operation 902, mailbox storage information of a server is received. The computing device 118, for example, receives mailbox storage information of the server 104. Mailbox storage information may include label ranges for each of multiple mailbox bins of the server 104, mailbox bin indexing structures of the multiple mailbox bins, and/or a quantity of mailboxes stored within each of the mailbox bins.

At operation 904, the server is queried to locate an electronic communication in each of a plurality of mailbox bins of the server. For example, computing device 118 queries the server 104, using methods described herein. The querying, for instance, includes probing to locate an electronic communication in each of a plurality of the multiple mailbox bins.

At operation 906, a query matrix is delivered for each of the plurality of mailbox bins to the server to isolate content of a target electronic communication within each of the plurality of mailbox bins. For example, computing device 118 delivers a query matrix (or query vector) to the server 104 for each of a plurality of the multiple mailbox bins configured to secretly isolate content of a plurality of target electronic communications within each of the plurality of mailbox bins, according to methods described herein.

At operation 908, the contents of the target electronic communications are received from the server. The computing device 118, for instance, receives the contents of the target electronic communications 112 from the server 104. In some implementations, computing device 118 is only able to receive one target electronic communication in each mailbox bin during a receiving phase. This may result in computing device 118 being unable to receive each target electronic communication in a single receive phase. Therefore, in some implementations, electronic communications remain for a predetermined quantity of send and receive phases so that computing device 118 may retrieve an electronic communication that it was unable to receive during a previous receive phase. Alternatively, or additionally, steps 804 through 808 may be repeated within a single receive phase. However, to protect an identity, or prevent a narrowing of a group of possible identities of a target electronic communication, steps 804 through 808 may be repeated for each mailbox bin of the server 104 and may be repeated a predetermined quantity of times during every receive phase.

Example System and Device

FIG. 10 illustrates an example system generally at 1000 that includes an example computing device 1002, which is representative of one or more computing systems and/or devices that may implement the various techniques described herein. This is illustrated through inclusion of private information retrieval (“PIR”) module 1014, which may be configured to implement one or more operations performed by computing device 102-1, 102-2, or 102-3, the server 104 or computing device 118 relating to computing device 102-1, 102-2, or 102-3 securely sending electronic communication 112 to the second computing device 118 without the server 104 being aware of the contents of the electronic communication 112 or being aware of a communication relationship of the first computing device 102 and the second computing device 118 as discussed herein. For example, PIR module 1014 may be representative of encryption/decryption module 214, query module 216, mailboxes index 108, or matrix operations module 308.

Computing device 1002 may be, for example, a server of a service provider, a device associated with a client (e.g., a client device), an on-chip system, and/or any other suitable computing device or computing system.

Computing device 1002 as illustrated includes a processing system 1004, one or more computer-readable media 1006, and one or more I/O interfaces 908 that are communicatively coupled, one to another. Although not shown, computing device 1002 may further include a system bus or other data and command transfer system that couples the various components, one to another. A system bus can include any one or combination of different bus structures, such as a memory bus or memory controller, a peripheral bus, a universal serial bus, and/or a processor or local bus that utilizes any of a variety of bus architectures. A variety of other examples are also contemplated, such as control and data lines.

Processing system 1004 is representative of functionality to perform one or more operations using hardware. Accordingly, processing system 1004 is illustrated as including hardware elements 1010, which may be configured as processors, functional blocks, and so forth. This may include implementation in hardware as an application-specific integrated circuit or other logic device formed using one or more semiconductors. Hardware elements 1010 are not limited by the materials from which they are formed or the processing mechanisms employed therein. For example, processors may be comprised of semiconductor(s) and/or transistors (e.g., electronic integrated circuits (ICs)). In such a context, processor-executable instructions may be electronically-executable instructions.

Computer-readable storage media 1006 is illustrated as including memory/storage 1012. Memory/storage 1012 represents memory/storage capacity associated with one or more computer-readable media. Memory/storage 1012 may include volatile media (such as random access memory (RAM)) and/or nonvolatile media (such as read only memory (ROM), Flash memory, optical disks, magnetic disks, and so forth). Memory/storage 1012 may include fixed media (e.g., RAM, ROM, a fixed hard drive, and so on) as well as removable media (e.g., Flash memory, a removable hard drive, an optical disc, and so forth). Computer-readable media 1006 may be configured in a variety of other ways as further described below.

Input/output interfaces 1008 are representative of functionality to allow a user to enter commands and information to computing device 1002, and also allow information to be presented to the user and/or other components or devices using various input/output devices. Examples of input devices include a keyboard, a cursor control device (e.g., a mouse), a microphone, a scanner, touch functionality (e.g., capacitive or other sensors that are configured to detect physical touch), a camera (e.g., which may employ visible or non-visible wavelengths such as infrared frequencies to recognize movement as gestures that do not involve touch), and so forth. Examples of output devices include a display device (e.g., a monitor or projector), speakers, a printer, a network card, tactile-response device, and so forth. Thus, computing device 1002 may be configured in a variety of ways as further described below to support user interaction.

Various techniques may be described herein in the general context of software, hardware elements, or program modules. Generally, such modules include routines, programs, objects, elements, components, data structures, and so forth that perform particular tasks or implement particular abstract data types. The entities described herein (e.g., PIR module 1014) generally represent software, firmware, hardware, or a combination thereof. The features of the techniques described herein are platform-independent, meaning that the techniques may be implemented on a variety of commercial computing platforms having a variety of processors.

An implementation of the described entities and techniques may be stored on or transmitted across some form of computer-readable media. The computer-readable media may include a variety of media that may be accessed by computing device 1002. By way of example, and not limitation, computer-readable media may include “computer-readable storage media” and “computer-readable signal media.”

“Computer-readable storage media” may refer to media and/or devices that enable persistent and/or non-transitory storage of information in contrast to mere signal transmission, carrier waves, or signals per se. Thus, computer-readable storage media refers to non-signal bearing media. The computer-readable storage media includes hardware such as volatile and non-volatile, removable and non-removable media and/or storage devices implemented in a method or technology suitable for storage of information such as computer readable instructions, data structures, program modules, logic elements/circuits, or other data. Examples of computer-readable storage media may include, but are not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, hard disks, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or other storage device, tangible media, or article of manufacture suitable to store the desired information and which may be accessed by a computer.

“Computer-readable signal media” may refer to a signal-bearing medium that is configured to transmit instructions to the hardware of computing device 1002, such as via a network. Signal media typically may embody computer readable instructions, data structures, program modules, or other data in a modulated data signal, such as carrier waves, data signals, or other transport mechanism. Signal media also include any information delivery media. The term “modulated data signal” means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media include wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared, and other wireless media.

As previously described, hardware elements 1010 and computer-readable media 1006 are representative of modules, programmable device logic and/or fixed device logic implemented in a hardware form that may be employed in some implementations to implement at least some aspects of the techniques described herein, such as to perform one or more instructions. Hardware may include components of an integrated circuit or on-chip system, an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a complex programmable logic device (CPLD), and other implementations in silicon or other hardware. In this context, hardware may operate as a processing device that performs program tasks defined by instructions and/or logic embodied by the hardware as well as a hardware utilized to store instructions for execution, e.g., the computer-readable storage media described previously.

Combinations of the foregoing may also be employed to implement various techniques described herein. Accordingly, software, hardware, or executable modules may be implemented as one or more instructions and/or logic embodied on some form of computer-readable storage media and/or by one or more hardware elements 1010. Computing device 1002 may be configured to implement particular instructions and/or functions corresponding to the software and/or hardware modules. Accordingly, implementation of a module that is executable by computing device 1002 as software may be achieved at least partially in hardware, e.g., through use of computer-readable storage media and/or hardware elements 1010 of processing system 1004. The instructions and/or functions may be executable/operable by one or more articles of manufacture (for example, one or more computing devices 1002 and/or processing systems 1004) to implement techniques, modules, and examples described herein.

The techniques described herein may be supported by various configurations of computing device 1002 and are not limited to the specific examples of the techniques described herein. This functionality may also be implemented all or in part through use of a distributed system, such as over a “cloud” 1016 via a platform 1018 as described below.

Cloud 1016 includes and/or is representative of platform 1018 for resources 1020. Platform 818 abstracts underlying functionality of hardware (e.g., servers) and software resources of cloud 1016. Resources 1020 may include applications and/or data that can be utilized while computer processing is executed on servers that are remote from computing device 1002. Resources 1020 can also include services provided over the Internet and/or through a subscriber network, such as a cellular or WI-FI® network.

Platform 1018 may abstract resources and functions to connect computing device 1002 with other computing devices. Platform 1018 may also serve to abstract scaling of resources to provide a corresponding level of scale to encountered demand for resources 1020 that are implemented via platform 1018. Accordingly, in an interconnected device implementation, implementation of functionality described herein may be distributed throughout system 1000. For example, the functionality may be implemented in part on computing device 1002 as well as via platform 816 that abstracts the functionality of cloud 1014.

In the discussions herein, various different embodiments are described. It is to be appreciated and understood that each embodiment described herein can be used on its own or in connection with one or more other embodiments described herein. Further aspects of the techniques discussed herein relate to one or more of the following embodiments.

A system for sending secure electronic communications, the system comprising: at least one processor; and at least one computer-readable storage medium storing instructions that are executable by the at least one processor to perform operations including: storing multiple target electronic communications in multiple target mailboxes, each of the multiple target electronic communications having two labels and being stored into two distinct target mailboxes having corresponding labels; sorting the multiple target mailboxes, based on labels, into multiple mailbox bins and indexing locations of mailboxes stored therein; receiving a query including one or more probes to locate indexed locations of at least one target mailbox in each of a plurality of the multiple mailbox bins, and a query vector of encrypted entries to isolate content of at least one of the target mailboxes in each of the plurality of the mailbox bins; and sending content of a target mailbox stored in each of the plurality of mailbox bins to an external computing device.

Alternatively or in addition to any of the above described systems, any one or combination of: wherein at least one of the plurality of mailbox bins is indexed into a hierarchical data structures and wherein levels of the hierarchical data structure independently queryable; wherein the operations further include, prior to said sending, delivering labels of a plurality of levels of the hierarchal data structure of one or more of the multiple mailbox bins to the external computing device; wherein at least one of the plurality of mailbox bins is indexed according to alphanumerical ordering; wherein the query comprises a series probes, each of which is determined based in part on a result of a previous probe, to locate the at least one target mailbox in each of the plurality of the multiple mailbox bins; wherein the system comprises one or more untrusted servers or untrusted internet service providers; wherein a quantity of mailbox bins is equal to a quantity of electronic communications that the external computing device is permitted to receive from the system in a single receiving phase; wherein the operations further include sending to the external computing device information about one or more of the multiple mailbox bins including one or more of label ranges, quantity of messages, and storage structure of the mailbox bin.

A method for secure electronic communication, the method comprising: querying a server, the query configured to find an indexed location of a target mailbox in each of a plurality of multiple mailbox bins; delivering to the server a query vector generated for each of the plurality of mailbox bins, the query vector configured with entries to isolate content of the target mailbox a mailbox bin; receiving content of the target mailbox from each of the plurality of the mailbox bins.

Alternatively or in addition to any of the above described systems, any one or combination of: the method further comprising determining a mailbox content retrieval strategy based on label ranges of the plurality of mailbox bins; further comprising receiving mailbox storage information from the server including one or more of label ranges, quantity of messages, and storage structure of the plurality of mailbox bins; the query further comprising a random query directed to a mailbox bin not containing a target mailbox, the random query to probe the mailbox bin to imitate attempting to find an indexed location of a target mailbox; further comprising sending a query vector to the mailbox bin wherein the query vector is configured with entries encrypted to 0 at all entries; further comprising, after a predetermined interval: dummy querying the server, the dummy query imitating querying without indexed locations of target mailboxes to find; and delivering to the server, a dummy query vector of ciphertext entries encrypted to 0 at all entries; wherein querying, delivering, and receiving are repeated to retrieve a second target mailbox from a mailbox bin from which a first target mailbox was already retrieved; wherein the querying, delivering, and receiving are repeated in each of the plurality of mailbox bins to protect metadata relating to a label range of the second target mailbox; wherein querying further comprises performing joint tree reversal to find an indexed location of at least one target mailbox in at least one of the multiple mailbox bins.

A method for sending multiple secure electronic communications, the method comprising: storing multiple target electronic communications in multiple target mailboxes, each of the multiple target electronic communications having two labels and being stored into two distinct target mailboxes having corresponding labels; sorting the multiple target mailboxes, based on labels, into multiple mailbox bins and indexing locations of mailboxes stored therein; receiving a query including one or more probes for a plurality of the multiple mailbox bins to locate a target mailbox of a plurality of mailbox bins based on the labels of the target mailboxes and a set of query vectors, each of the set of query vectors directed to one of the plurality of mailbox bins, to isolate a target mailbox in the one of the plurality of mailbox bins; and sending contents of the target mailboxes stored in each of the plurality of mailbox bins to an external computing device.

Alternatively or in addition to any of the above described methods, any one or combination of: wherein the locations of mailboxes stored in the multiple mailbox bins are indexed into binary search trees and wherein, prior to receiving the one or more probes of the query, labels of a plurality of levels of the binary search trees are sent to the external computing device; further comprising encoding one or more bin contents via a sub-cube batch code.

Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.

CONCLUSION

Although the implementations have been described in language specific to structural features and/or methodological acts, it is to be understood that the implementations defined in the appended claims are not necessarily limited to the specific features or acts described. Rather, the specific features and acts are disclosed as example forms of implementing the claimed implementations. 

What is claimed is:
 1. A system for obscuring mailbox identity during transmission of electronic communications, the system comprising: at least one processor; and at least one computer-readable storage medium storing instructions that are executable by the at least one processor to perform operations including: storing target electronic communications across multiple mailboxes, wherein the target electronic communications having one or more encrypted labels; sorting the multiple mailboxes into multiple mailbox bins that each comprise a label range for the multiple mailboxes, wherein the sorting comprises indexing locations of the multiple mailboxes within the multiple mailbox bins based on the one or more encrypted labels; receiving a query including: one or more probes to locate indexed locations of at least one target mailbox in each of the multiple mailbox bins, and a query vector that comprises encrypted entries to isolate content from the at least one target mailbox in each of the multiple mailbox bins; and sending the isolated content of the at least one target mailbox in each of the multiple mailbox bins to an external computing device based on processing of the query vector.
 2. A system as in claim 1, wherein at least one of the multiple mailbox bins is indexed into a hierarchical data structures and wherein levels of the hierarchical data structure are independently queryable to identify the indexed locations of the at least one target mailbox.
 3. A system as in claim 2, wherein the operations further include, prior to said sending, delivering labels of a plurality of levels of the hierarchal data structure of one or more of the multiple mailbox bins to the external computing device.
 4. A system as in claim 1, wherein at least one of the multiple mailbox bins is indexed according to alphanumerical ordering.
 5. A system as in claim 1, wherein the query comprises a series probes, each of which is determined based in part on a result of a previous probe, to locate the at least one target mailbox in each of the multiple mailbox bins.
 6. A system as in claim 5, wherein the system comprises one or more untrusted servers or untrusted interne service providers.
 7. A system as in claim 1, wherein a quantity of mailbox bins is equal to a quantity of electronic communications that the external computing device is permitted to receive from the system in a single receiving phase.
 8. A system as in claim 1, wherein the operations further include sending, to the external computing device, information about one or more of the multiple mailbox bins including one or more of label ranges, quantity of messages, and storage structure of respective multiple mailbox bins.
 9. A method for obscuring mailbox identity during transmission of electronic communications, the method comprising: storing target electronic communications across multiple mailboxes, wherein the target electronic communications having one or more encrypted labels; sorting the multiple mailboxes into multiple mailbox bins that each comprise a label range for the multiple mailboxes, wherein the sorting comprises indexing locations of the multiple mailboxes within the multiple mailbox bins based on the one or more encrypted labels; receiving a query including: one or more probes for searching the multiple mailbox bins to locate a target mailbox of in each of the multiple mailbox bins, and a set of query vectors, each query vector directed to one of the multiple mailbox bins, to isolate content in target mailboxes in the multiple mailbox bins; and sending contents of the target mailboxes to an external computing device based on processing of the set of query vectors.
 10. A method as in claim 9, wherein locations of the target mailboxes stored in the multiple mailbox bins are indexed into binary search trees and wherein, prior to receiving the one or more probes of the query, labels of a plurality of levels of the binary search trees are sent to the external computing device.
 11. A method as in claim 9, further comprising encoding one or more bin contents of the multiple mailbox bins via a sub-cube batch code.
 12. A method as in claim 9, wherein at least one of the multiple mailbox bins is indexed into a hierarchical data structures, and wherein levels of the hierarchical data structure are independently queryable to identify the indexed locations of the at least one target mailbox.
 13. A method as in claim 9, wherein the query comprises a series probes, each of which is determined based in part on a result of a previous probe, to locate the at least one target mailbox in each of the multiple mailbox bins.
 14. A method as in claim 9, wherein at least one of the multiple mailbox bins is indexed according to alphanumerical ordering.
 15. A method as in claim 9, further comprising: sending, to the external computing device, information about one or more of the multiple mailbox bins including one or more of label ranges, quantity of messages, and storage structure of respective multiple mailbox bins.
 16. A method as in claim 9, further comprising: prior to said sending, delivering labels of a plurality of levels of the hierarchal data structure of one or more of the multiple mailbox bins to the external computing device.
 17. A computer storage device storing computer-executable instructions that, when executed by at least one processor, causes the at least one processor to execute a method for obscuring mailbox identity during transmission of electronic communications, the method comprising: storing target electronic communications across multiple mailboxes, wherein the target electronic communications having one or more encrypted labels; sorting the multiple mailboxes into multiple mailbox bins that each comprise a label range for the multiple mailboxes, wherein the sorting comprises indexing locations of the multiple mailboxes within the multiple mailbox bins based on the one or more encrypted labels; receiving a query including: one or more probes for searching the multiple mailbox bins to locate a target mailbox in each of the multiple mailbox bins, and a set of query vectors, each query vector directed to one of the multiple mailbox bins, to isolate content in target mailboxes in the multiple mailbox bins; and sending contents of the target mailboxes to an external computing device based on processing of the set of query vectors.
 18. The computer storage device as in claim 17, wherein at least one of the multiple mailbox bins is indexed into a hierarchical data structures, and wherein levels of the hierarchical data structure are independently queryable to identify the indexed locations of the at least one target mailbox.
 19. A computer storage device as in claim 17, wherein the query comprises a series probes, each of which is determined based in part on a result of a previous probe, to locate the at least one target mailbox in each of the multiple mailbox bins.
 20. A computer storage device as in claim 17, wherein at least one of the multiple mailbox bins is indexed according to alphanumerical ordering. 